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© Image processing method and apparatus therefor. 



(57) A character recognition method capable of determining the direction of row of characters, applicable 
to a language which can be written in horizontal or vertical direction, such as Japanese language, is 
disclosed. The method takes projection in both directions, by counting black pixels, and compares the 
obtained counts with a threshold value. The text is identified as horizontally or vertically mitten, 
respectively if the count exceed the threshold value more in the horizontal or vertical direction. 
Projection is taken again in the perpendicular direction, after the writing direction is determined, to 
extract each character, which is subjected to character recognition in known manner. Thus the method 
can achieve automatic recognition, without instruction of the direction of writing, by the operator. 
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BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to an image proces- 
sing method for receiving text image information in 
which horizontal or vertical writing is not identified, 
and recognizing characters etc. in said text image 
information, and an apparatus therefor. 

Related Background Art 

Conventional character recognition apparatus is 
composed of an input unit for reading a text image, a 
pre-treatment unit for extracting a character from the 
input image and applying various normalizations to 
the extracted image, and a recognition unit for extract- 
ing geometrical features from the normalized pattern 
and comparing said features with standard patterns 
stored in advance in a dictionary, thereby recognizing 
the character portion of said text image. Since the 
Japanese text may be present in vertical writing, in 
addition to the ordinary horizontal writing, the pre- 
treatment unit for extracting the character pattern in 
the character recognition apparatus for Japanese 
characters has been designed, for a horizontally writ- 
ten text image, to extract a row of characters by taking 
a horizontal image projection, and to extract a charac- 
ter by taking a vertical image projection of thus extrac- 
ted row or rows. Also for a vertically written text image, 
a row of characters is extracted by taking a vertical 
image projection, and a character is extracted by tak- 
ing a horizontal image projection on thus extracted 
row or rows. 

Also for designated a vertically or horizontally 
written area in text image information, there is already 
known an apparatus allowing, on a display of the 
image information entered by the user, to designate 
an area therein and whether said area is vertically or 
horizontally written. 

For this reason, the character recognition 
apparatus for Japanese characters employs different 
process for the horizontal writing and for the vertical 
writing. It has therefore been necessary to prepare a 
character recognition apparatus exclusive for hori- 
zontal or vertical writing, or to designate the horizontal 
or vertical writing by the user prior to the character rec- 
ognition. 

In consideration of the drawbacks of the prior 
technology explained above, the object of the present 
invention is to provide a character recognition 
apparatus capable of character recognition, by 
detecting whether the text image is vertically or hori- 
zontally written, based on said text image. 

Also in the above-explained prior technology, if 
vertically and horizontally written texts are mixedly 
present in the image information, as in the case of 
magazines, the use is required to execute a cumber- 



some operation of designating each vertically or hori- 
zontally written area and classifying each area as ver- 
tically or horizontally written. 

5 SUMMARY OF THE INVENTION 

The object of the present invention is to resolve 
the drawbacks in the above-explained prior technol- 
ogy, and to discriminate whether the text of input 

10 image information is vertically or horizontally written, 
by taking histograms in the vertical and horizontal 
directions on the entered text image information, and 
comparing thus prepared histograms with suitable 
reference values or investigating the relationships of 

15 said histograms in the vertical and horizontal direc- 
tions. 

Also in image information containing a vertically 
written text and a horizontally written text in mixed 
manner, the detection of areas and of mode of writing 
20 in each area can be achieved without the toil of 
operator, by partially analyzing the histograms in each 
of horizontal and vertical directions. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 

Fig. 1 is a block diagram of an image processing 
apparatus embodying the present invention; 
Fig. 2 is a view showing an example of projections 
of input text image information in said embodi- 
30 ment; 

Fig. 3 is a first flow chart of the recognition pro- 
cess of said embodiment; 

Fig. 4 is a first flow chart of a process for detecting 
the direction of row of characters; 

35 Fig. 5 is a second flow chart of the process for 

detecting the direction of row of characters; 
Fig. 6 is a third flow chart of the process for detect- 
ing the direction of row of characters; 
Fig. 7 is a fourth flow chart of the process for 

40 detecting the direction of row of characters; 

Fig. 8 is a fifth flow chart of the process for detect- 
ing the direction of row of characters; 
Fig. 9 is a view showing a second example of pro- 
jections of the input text image information; 

45 Fig. 1 0 is a second flow chart of the character rec- 

ognition process; 

Fig. 1 1 is a sixth flow chart of the process for 
detecting the direction of row of characters; 
Fig. 12 is a seventh flow chart of the process for 
50 detecting the direction of row of characters; 

Fig. 13 is a first view for explaining an embodi- 
ment 3.1; 

Fig. 14 is a second view for explaining the embo- 
diment 3.1; 

55 Fig. 15 is a third view for explaining the embodi- 

ment 3.1; 

Fig. 16 is a fourth view for explaining the embo- 
diment 3.1; 
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Fig. 17 is an eighth flow chart of the process for 
detecting the direction of row of characters; 
Fig. 18 is a view showing a third example of pro- 
jections of the input text image information; 
Fig. 19 is a third flow chart of the character rec- 
ognition process; 

Fig. 20 is a ninth flow chart of the process for 
detecting the direction of row of characters; 
Fig. 21 is a view showing area separation by com- 
parison of histograms with reference values; and 
Fig. 22 is a tenth flow chart of the process for 
detecting the direction of row of characters. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

Now the present invention will be clarified in detail 
by preferred embodiments thereof shown in the 
attached drawings. 

[Description of character recognition apparatus 
(Fig- 1)1 



Fig. 1 is a schematic block diagram of a character 
recognition apparatus embodying the present inven- 25 
tion, common to all the embodiments. 

In Fig. 1 there are shown the entire character rec- 
ognition apparatus 1 of the present embodiment; an 
input unit 2, composed for example of a scanner, for 
reading a text image and generating digital original 30 
image data; a central processing unit (CPU) 3 for con- 
trolling the entire apparatus; a ROM 4 storing a control 
program for the CPU 3 corresponding to flow charts 
to be explained later and other various data; and a 
memory (RAM) 5 used as a work area of the CPU 3 35 
and also having an area for storing the text image data 
entered from the input unit 2. 

There are further shown a character row determi- 
nation unit 6 for determining the direction of row of 
characters in the text image stored in the memory 5, 40 
namely whether said text is vertically or horizontally 
written, and extracting respective areas if vertical and 
horizontal writings are mixediy present as shown in 
Fig. 18; a pre-processing unit 7 for extracting the pat- 
tern of a character from the text image stored in the 45 
memory 5 and effecting variousnormalizations; a rec- 
ognition unit 8 for extracting geometrical features from 
the pattern pre-processed by the pre-processing unit 
7 and comparing said features with standard patterns 
stored in advance in a dictionary 9 for recognition; a so 
display unit 10 for displaying, for example, the result 
of recognition on the input text image; a system bus 
1 1 including a data bus, an address bus, a control sig- 
nal bus etc. from the CPU 3; and an interface unit 12 
for interface control with an extemaJ output unit such 55 
as a printer 13. 



[Description of text image (Fig. 2)] 

Fig. 2 shows an example of text image entered 
into the character recognition apparatus of the pre- 
sent embodiment. 

In Fig. 2 there are shown a text image 21 read by 
the input unit 2; a projection 22 of the text image 21 
(count of black pixels) when it is scanned in the hori- 
zontal direction; a similar projection 23 of said text 
image 21 obtained by vertical scanning; and tines 24, 
25 indicating a reference value a. 

In the example shown in Fig. 2, the entered text 
image 21 is horizontally written, and it will be under- 
stood that the number of lines in which the count of 
black pixels exceeds the reference value a is larger 
in the horizontal direction than in the vertical direction. 

[1.1 Embodiment of counting black pixels in each 
row, utilizing the reference value a] 

Fig. 3 is a flow chart of the control sequence of the 
CPU 3 In the present embodiment, and a correspond- 
ing control program is stored in the ROM 4. 

At first a step SO reads a text image, set by the 
operator, by the input unit 2 (scanner), and stores it 
in the memory 5. In a next step S2, the character line 
determination unit 6 discriminates whether the text 
image stored in the memory 5 is horizontally or verti- 
cally written, by calculating the projections (counts of 
black pixels) of said text image in the horizontal and 
vertical directions, then determining the ratio of num- 
ber of lines in which said pixel count exceeds the 
reference value a to the total number of lines, and 
comparing said ratio for the horizontal direction with 
that for the vertical direction. 

Fig. 4 shows the details of said step S2 executed 
by the determination unit 6. At first a step S 1 1 counts 
the number of black pixels for each horizontal line, 
and a step S12 determines the number of lines in 
which the obtained count is equal to or larger than the 
reference value a. This operation is conducted for all 
the horizontal lines (number TH) of the text image, 
and the number UH of lines in which the count is equal 
to or larger than the reference value a is determined. 
Then the ratio RH of the number UH of lines in which 
the pixel count is equal to or larger than the reference 
value a to the total line number TH is calculated by: 
RH = UH/TH. 

Then a step S13 counts the number of black 
pixels in each vertical line, and a step S14 discrimi- 
nates whether the count is at least equal to the refer- 
ence value a. This operation is conducted for all the 
vertical lines (numbber TV), and the number UV of 
lines in which the count is equal to or larger than the 
reference value a is determined. Then the ratio RV of 
said line number UV to the total line number TV is cal- 
culated by: 

RV = UWTV. 
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Then a step S1 5 compares the obtained ratio RH 
for the horizontal direction with that RV for the vertical 
direction. If a condition RH ^ RV is satisfied, the text 
image is identified as horizontally written. 

Referring again to Fig. 3, and if the text image is 
horizontally written, the sequence proceeds to a step 

53 to effect pre-processing for horizontal writing in the 
pre-processing unit 7. Said preprocessing for horizon- 
tal writing consists of character extraction by extract- 
ing a row of characters by taking a projection in the 
horizontal direction and extracting a character by tak- 
ing a projection in the vertical direction on the extrac- 
ted row(s) of characters, normalization of position, 
inclination, line width and density of the character, and 
noise elimination. 

On the other hand, jf the condition RH ^ RV is not 
satisfied in the step S2, the text image stored in the 
memory 5 is identified as vertically written, and a step 

54 executes pre-processing for vertical writing in the 
pre-processing unit 7. Said pre-processing consists of 
character extraction by extracting a row of characters 
by taking a projection in the vertical direction and ext- 
racting a character by taking a projection in the hori- 
zontal direction on the extracted row(s) of characters, 
normalization, of position, inclination, line width and 
density of the character, and noise elimination. 

Then, in step S5, the recognition unit 8 effects ext- 
raction of features such as contour line on each 
unknown character pattern pre-processed in the pre- 
processing unit 7. The dictionary 9 stores the features 
of standard patterns of all the characters to be iden- 
tified. The feature of the unknown character pattern is 
matched with all the features of the standard patterns 
stored in the dictionary 9, and a character pattern of 
the highest probability is stored as the result of recog- 
nition in the memory 5. All the characters of the input 
text image are similarly recognized and stored in the 
memory 5. A next step S6 displays the results of rec- 
ognition, stored in the memory 5, on the display unit 
10. An output process of characters by the printer 13 
is omitted for the purpose of simplicity. 

As explained in the foregoing, this embodiment 
provides a character recognition apparatus which can 
adapt texts of horizontal and vertical writings and is 
therefore easily usable for the users. 

[1.2 Variation of 1.1: embodiment with counting 
from vertical direction] 

Fig. 5 shows the details of the step S2, in Fig. 3, 
for discriminating whether the text image is horizon- 
tally or vertically written. In this embodiment, the order 
of sequence of the steps S1 1 , S12 and the steps S13, 
S14 is inverted in comparison with the foregoing 
embodiment 1.1. More specifically, the embodiment 
1.1 counts the black pixels at first in the horizontal 
direction and then in the vertical direction, whereas 
the present embodiment effects said counting at first 



in the vertical direction and then in the horizontal 
direction. The structure of the apparatus in the pre- 
sent embodiment is same as shown in Fig. 1. 

The present embodiment can provide same 
5 effects and advantages as in the foregoing embodi- 
ment 1.1. 

[1.3 Variation of 1.1: Embodiment for counting 
white pixels] 

10 

Fig. 6 is a flow chart of the present embodiment 
in which the counting of black pixels in the step S2 in 
the embodiment 1.1 is replaced by counting of white 
pixels. In this embodiment, the structure of the 

15 character recognition apparatus and the control sequ- 
ence other than said step S2 are identical to those of 
the embodiment 1.1, shown in Figs. 1 and 3. 

At first a step S31 counts the white pixels in each 
horizontal line, and a step S32 discriminates whether 

20 the obtained count is at least equal to the reference 
value a, then determines the number of lines in which 
the obtained count is at least equal to the reference 
value a over the entire input image, and calculates the 
ratio RH of the number of such lines to the entire num- 

25 ber of lines. 

Then a step S33 counts the white pixels in each 
vertical line, and a step S34 discriminates whether the 
obtained count is at least equal to the reference value 
a, then determines the number of lines in which the 

30 obtained count is at least equal to the reference value 
a over the entire input image, and calculates the ratio 
RV of the number of such lines to the entire number 
of lines. 

A next step S35 compares RH and RV, and the 
35 text image is identified as horizontally written or ver- 
tically written, respectively if a condition RH ^ RV is 
satisfied or not 

Also the present embodiment explained above 
can provide similar effects and advantages as those 
40 in the foregoing embodiment 

v. 

[1.4 Variation of 1.1: Embodiment for counting 
plural lines] 

45 Fig. 7 is a flow chart of the present embodiment, 

in which the counting of black pixels in the step 2 of 
the embodiment 1.1 is replaced by counting of black 
pixels in plural lines. Also in this embodiment, the 
structure of the character recognition apparatus and 

so the control sequence except for said step S2 are iden- 
tical to those of the embodiment 1.1, shown in Figs. 1 
and 3. 

At first a step S41 counts the black pixels in every 
plural lines, and a step S42 discriminates whether the 
55 obtained count is at least equal to a reference value 
a, then determines, in the entire input image, the num- 
ber of lines in which the obtained count is at least 
equal to said reference value a, and calculates the 
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ratio RH of the number of such line to the entire num- 
ber of lines. 

A step S43 similarly effects counting of black 
pixels in the vertical direction and a step S44 deter- 
mines the ratio RV of the number of lines in which the 
count is at least equal to the reference value a to the 
entire number of lines. 

A next step S45 compares thus determined RH 
and RV, and the text image is identified as horizontally 
written or vertically written, respectively if a condition 
RH ^ RV is satisfied or not. 

The present embodiment explained above can 
also provide similar effects and advantages as those 
in the foregoing embodiments. 

[1 .5 Variation of 1.3: Embodiment with counting 
from vertical direction] 

The above-explained embodiment 1.3 can pro- 
vide similar effects and advantages as those in the 
embodiment 1.1 even when the sequence of horizon- 
tal counting and vertical counting is interchanged as 
in the embodiment 1.2. The configuration of the 
character recognition apparatus and the control sequ- 
ence remain same as those of the embodiment 1.1, 
shown in Figs. 1 and 3, and the only difference lies in 
the step S2 shown in Fig. 6, in which the steps are 
executed in the order of S33, S34 f S31 and S32. 

[1.6 Variation of 1.3: Embodiment for counting 
plural lines] 

The above-explained embodiment 1.3 can pro- 
vide similar effects and advantages as those in the 
embodiment 1.1, even when the counting for each line 
is replaced by counting of plural lines as the foregoing 
embodiment 1.4. In this embodiment, the configu- 
ration of the character recognition apparatus and the 
control sequence remain same as those of the embo- 
diment 1.1 shown in Figs. 1 and 3, and the only differ- 
ence lies in the step S2 in Fig. 6, in which the steps 
S31 and S33 execute counting on plural lines at a time 
instead of a line. 

[1.7 Variation of 1.4: Embodiment with counting 
from vertical direction] 

The above-explained embodiment 1.4 can also 
provide similar effects and advantages to those of the 
embodiment 1.1, even when the sequence of horizon- 
tal counting and vertical counting is interchanged as 
in the embodiment 1.2. In this embodiment, the con- 
figuration of the character recognition apparatus and 
the control sequence remain same as those of the 
embodiment 1.1 shown in Figs. 1 and 3, and the only 
difference lies in the step S2 shown in Fig. 7, in which 
the steps are executed in the order of S43, S44, S41 
and S42. 



[1.8 Variation of 1.1: Embodiment of taking lines in 
which the count is equal to or less than the 
reference value a] 

5 In the step S2 of the embodiment 1.1, shown in 

Fig. 4, the condition "the count of black pixels equal 
to or larger than the reference value a" may be rep- 
laced by "the count of black pixels equal to or less 
than the reference value a". In this embodiment, the 

10 configuration of the character recognition apparatus 
and the control sequence except for the step S2 are 
same as those of the embodiment 1.1 shown in Figs. 
1 and 3. 

Fig. 8 shows the details of the step S2 of the con- 

15 trol sequence in the present embodiment. 

At first a step S51 counts the black pixeLs in each 
horizontal line, and discriminates whether the 
obtained count is not in excess of a reference value 
a. Then a step S52 counts, in the entire input image, 

20 the number of lines in which the count is not in excess 
of the reference value a, and calculates the ratio RH 
of the number of such lines to the entire number of 
horizontal lines. 

Then a step S53 counts the black pixels in each 

25 vertical lines, and step S54 discriminates whether the 
obtained count is not in excess of the reference value 
a, counts the number of lines in which the count is not 
in excess of the reference value a, and calculates the 
ratio RV of such lines. A next step S55 compares RH 

30 with RV, and identifies the text image as horizontally 
or vertically written, respectively if a condition RH ^ 
RV is satisfied or not. 

The above-explained embodiment 1.8 can pro- 
vide similar effects and advantages to those in the 

35 aforementioned embodiment 1.1. 

As explained in the foregoing, these embodi- 
ments are capable of automatically identifying the 
direction of row of characters in a text, thereby deter- 
mining whether the text image is written horizontally 

40 or vertically. Consequently, in the character recogni- 
tion by reading a text image, the operator is no longer 
required to enter whether the text is horizontally or 
vertically written, and the operation of character rec- 
ognition is therefore improved. 

45 

[2.1 Embodiment of using different reference values 
for the horizontal and vertical directions] 

In this embodiment, reference values a, p are 
so employed respectively in the horizontal and vertical 
directions as shown in Fig. 9, instead of the single 
reference value a in the embodiment 1.1 shown in Fig. 
2. 

Said reference values a, p are determined after 
55 the input of the image information as shown in a flow 
chart in Fig. 10. In the following, the step S1 alone will 
be explained since said flow chart differs only in said 
step S1 from that shown in Fig. 3. 



BNSDOCID: <EP 0472313A2_I_> 



EP0 472 313 A2 



10 



The reference values a, p are determined in the 
step S1 in the following manner. 

The reference values a, p respectively in the hori- 
zontal direction and in the vertical direction are given 
by: 

B TV . . 
a = — — + t H , 



P 



S H 
BTH 

S v 



+ t v 



where B is an initial reference value, TH and TV are 10 
total line numbers respectively in the horizontal and 
vertical directions in a designated area of the text 
image, and S H , Sv, t H , t v are correction factor a a, p 
(S H *0, S v *0). 

The use of the height and width of the designated 1 5 
area of the text image enables exact identification of 
vertical or horizontal wiring, without being influenced 
by the vertical-to-horizontal ratio of the designated 
area. 

Fig. 9 shows an example of identifying the hori- 20 
zontal or vertical writing in the entire area of the text 
image. 

The reference values a, p determined in said step 
S1 may be employed similarly in the embodiments 1.1 
to 1.8 shown in Figs. 4 to 8. 25 

[2.2 Embodiment employing maximum black pixel 
count] 

Fig. 1 1 shows the details of the step S2 shown in 30 
Fig. 3, for identifying whether the text image is hori- 
zontally or vertically written. A step S61 determines 
the maximum values M H , M v of black pixels respect- 
ively in the horizontal and vertical directions, and cor- 
rects said maximum values with the horizontal line 35 
number TH, vertical line number VH and correction 
coefficients as indicated in Fig. 1 1 . A next step S63 
compares thus corrected maximum values of the 
black pixels in the horizontal and vertical directions, 
and the text image is identified as horizontally or ver- 40 
tically written respectively if the maximum value in the 
horizontal direction is larger or smaller than that in the 
vertical direction. 

The configuration of the image processing, 
apparatus in the present embodiment is same as that 45 
shown in Fig. 1. 

[3.1 Embodiment of assuming both horizontal and 

vertical writings and determining the direction from 

the similarity of the results of recognition] so 

Fig. 12 shows the details of the step S2, shown 
in Fig. 3, for discriminating whether the text image is 
horizontally or vertically written. 

A step S71 extracts a rectangular area according 55 
to a histogram in the horizontal direction (Fig. 13), 
then prepares a histogram in the vertical direction in 



said extracted area, and extracts one or plural rectan- 
gular areas (Fig. 14). Fig. 14 shows only the rectan- 
gular areas extracted from the first row. Then a step 
S72 extracts a rectangular area according to a histog- 
ram in the vertical direction (Fig. 15), prepares a his- 
togram in the horizontal direction in said extracted 
area, and extracts one or plural rectangular areas 
(Fig. 16). A next step S73 recognizes the rectangles 
extracted in the steps S71 , S72, and a step S74 deter- 
mines whether the text image is horizontally or verti- 
cally written, according to the level or similarity of said 
extracted rectangles. 

[3.2 Embodiment of assuming both horizontal and 
vertical writings and determining the writing 
direction from the vertical-to-horizontal ratio of 
extracted rectangle] 

Fig. 17 shows the details of the step S2, shown 
in Fig. 3, for determining whether the text image is 
horizontally or vertically written. 

A step S81 extracts a rectangle according to the 
histogram In the horizontal direction (Fig. 13), then 
prepares a histogram in the vertical direction, in said 
rectangle, and extracts one or plural rectangles (Fig. 
14). Fig. 14 only shows the rectangles extracted from 
the first row. Then a step S82 extracts a rectangle 
according to the histogram in the vertical direction 
(Fig. 15), then prepares a histogram in the horizontal 
direction, and extracts one or plural rectangles (Fig. 
16). A next step S83 calculates the vertical-to-hori- 
zontal (V-H) ratio of the rectangles extracted in the 
steps S81 and S82. 

In the example illustrated in Figs. 14 and 16, the 
V-H ratio is apparently closer to 1:1 in the rectangle 
extracted in the step S81. 

A next step S84 identifies the text image as hori- 
zontally or vertically written respectively if the V-H 
ratio closer to 1:1 is obtained in the step S81 or not. 

[4.1 Embodiment for a text including both 
horizontally and vertically written areas] 

Fig. 1 8 shows an example of the input text image, 
to the character recognition apparatus, including ver- 
tically and horizontally written areas in mixed manner. 

In Fig. 18 there are shown a text image 31 read 
by the input unit 2; a projection 32 of said text image 
(counts of black pixels) obtained by scanning said text 
image in the horizontal direction; a projection 33 of 
said text image obtained by scanning in the vertical 
direction; lines 34, 35 respectively corresponding to 
reference values a, P; horizontally written areas 36, 
42 identified by the character row/area determination 
unit; a vertically written area 37; characters 38 - 40 in 
said areas; and an area 41 identified as free of charac- 
ters. 

In the example shown in Fig. 18, the input text 
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image 31 contains both the horizontally and vertically 
written texts. 

Fig. 19 is a flow chart of the control sequence 
executed by the CPU 3 in the present embodiment, 
and a corresponding control program is stored in the 5 
ROM 4. 

At first a step S 1 00 enters a text image, set by the 
operator, through the input unit (scanner) 2 and stores 
said image in the memory 5. In a next step 101, the 
character determination unit 6 extracts horizontally 10 
written areas and vertically written areas from the text 
image stored in the memory 5, by determining projec- 
tions (counts of black pixels) in the horizontal and ver- 
tical directions on the text image stored in the memory 
5 and comparing the numbers of lines in which said is 
count is at least equal to the reference value a or p. 

Fig. 20 shows the details of said step S101 exec- 
uted by the character row determination unit 6. At first 
a step S1 1 1 counts the black pixels in each line, and 
determines an area or areas in which the obtained 20 
count is at least equal to the reference value a. In the 
example shown In Fig. 18, such an area is indicated 
by 36. 

A next step S1 12 similarly determines an area or 
areas in the vertical direction. In Fig. 18, such areas 25 
are indicated by 36, 37. 

A step S1 1 3 then determines the horizontally writ- 
ten areas and vertically written areas in the text 
image. 

The steps S1 1 1 and S1 12 classify the text image 30 
into areas of following four categories shown in Fig. 
21: 

Area A: 

where the count is at least equal to the reference 
value both in the horizontal and vertical directions; 35 

In such area A, histograms are prepared again in 
said area, and the area is identified as horizontally or 
vertically written respectively if the maximum value of 
the histogram minus the reference value is larger in 
the horizontal direction or in the vertical direction. 40 

Area B: 

where the count is at feast equal to the reference 
value only in the horizontal direction; 

Such area is identified as horizontally written. 
Thus the area 36 in Fig. 1 8 is identified as horizontally 45 
written. Since the areas 36 and 42 are mutually adja- 
cent and have the same writing direction, the two 
areas are combined as an area. 

Area C: where the count is at lease equal to the 
reference value only in the vertical direction; 50 

Such area is identified as vertically written. Thus 
the area 37 in Fig. 1 8 is identified as vertically written. 

Area D: 

where the count is less than the reference value both 

in the horizontal and vertical directions; 55 

Histograms are again prepared in such area in 
the horizontal and vertical directions, and the area is 
identified as horizontally or vertically written respect- 

8 



ively if the maximum value of the histogram minus the 
reference value is larger in the horizontal or vertical 
direction. A zero histogram indicates absence of 
characters in the area, as in case of the area 41 in Fig. 
18. 

The above-mentioned reference value are cor- 
rected by the width and height of the text image. 

After the identification of horizontally and verti- 
cally written areas, a step S114 determines the order 
of sequence of recognition of such areas, for example 
from above to bottom according to the magnitude of 
coordinate value. 

Again referring to the flow chart shown in Fig. 1 9, 
in case of a horizontally written area, the sequence 
proceeds to a step S103 in which the pre-processing 
unit 7 effects pre-processing for horizontal writing, 
including character extraction by taking horizontal 
projection to extract a character row and then taking 
vertical projection on the extracted row or rows; nor- 
malizations on the position, inclination and density of 
character; and noise elimination. 

On the other hand, if the step S102 identifies a 
vertically written area, the sequence proceeds to a 
step S104 in which the pre-processing unit 7 executes 
pre-processing for vertical writing, including character 
extraction by taking vertical projection to extract a 
character row and then taking horizontal projection on 
thus extracted row or rows; normalizations on the 
position, inclination and density of character; and 
noise elimination. 

In a next step S1 05, the recognition unit 8 extracts 
features, such as contours, from the unknown charac- 
ter pattern pre-processed in the pre-processing unit 7. 
The dictionary 9 store features of the standard pat- 
terns of all the characters to be recognized. The feat- 
ure of said unknown character pattern is matched with 
the features of the standard patterns stored in the dic- 
tionary 9, and the most probable character is stored 
as the result of recognition in the memory 5, and all 
the characters in the input text image are similarly rec- 
ognized and stored in the memory 5. 

Then the sequence proceeds to a step S106, and 
returns to the step S102 if any unrecognized area still 
exists, or proceeds to a step S107 if all the areas have 
been recognized. The step S107 displays the results 
of recognition, stored in the memory 5, on the display 
unit 10. The process of output of characters by the 
printer 13 will be omitted from the description. 

As explained in the foregoing, the present embo- 
diment provides a character recognition apparatus 
adaptable to any text that is horizontally or vertically 
written or contains both writings. 

[4.2 Embodiment of requesting confirmation by 
operator on the identification of horizontal or 
vertical writing and on the order of recognition] 

Fig. 22 shows the details of the step S101 in the 
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present embodiment. 

Steps S111 - S114 determine the horizontally 
and vertically written areas in the same manner as in 
the embodiment 4.1. Then a step S115 determines 
the property (vertical or horizontal writing) of the ext- 5 
racted areas and the order of recognition thereof, 
through request to and confirmation by the operator. 



Claims 

1. An image processing method comprising steps 
of: 

counting image data, on the input image 
information, in the horizontal and vertical direc- 
tions; 

comparing the results of sard counting with 
a threshold value; and 

determining the direction of row of charac- 
ters in said input image information, according to 
the results of said comparison. 

2. A method according to claim 1, wherein said 
threshold value includes a first threshold value for 
the horizontal direction and a second threshold 25 
value for the vertical direction. 

3. A method according to claim 1, wherein said 
threshold value corresponds to the size of area of 

said image information. 30 

4. An image processing method comprising steps 
of: 

counting image data, on the input image 
information, in the horizontal and vertical direc- 
tions; 

comparing maximum values in the hori- 
zontal and vertical directions in said counting of 
image data, and 

determining the direction of row of charac- 
ters in said input image information according to 
the result of said comparison. 

5. A method according to claim 4, wherein the 
results of said counting of image data are correc- 45 
ted according to the size of area of said image 
information. 

6. An image processing method comprising steps 
of: 

counting image data, on the input image 
information, in the horizontal and vertical direc- 
tion; 

deriving a vertical-to-horizontal ratio of a 
rectangular area of character information extrac- 
ted according to the results of said counting; and 

determining the direction of row of charac- 
ters in said image information according to thus 



derived vertical-to-horizontal ratio. 

7. An image processing method comprising steps 
of, 

counting image data, on the input image 
information, in the horizontal and vertical direc- 
tions; and 

deriving the direction of row of characters 
in a partical area in said image information, 



9. An apparatus according to claim 8, wherein said 
threshold value includes a first threshold value for 
the horizontal direction and a second threshold 
value for the vertical direction. 

10. An apparatus according to claim 8, wherein said 
threshold value corresponds to the size of area of 
said image information. 



12. An apparatus according to claim 11, further com- 
prising correction means for correcting the results 
of counting by said counter means, according to 
the size of area of said image information. 



50 

13. An image processing apparatus comprising: 

counter means for counting image data, on 
the input image information, in the horizontal and 
vertical directions; 
55 derivation means for deriving a vertical-to- 

horizontal ratio of a rectangular area of character 
information extracted according to the results of 
counting by said counter means; and 



10 according to the results of said counting. 

8. An image processing apparatus compressing: 

counter means for counting image data, on 
the input image information, in the horizontal and 
15 vertical directions; 

comparator means for comparing the 
results of counting of image data by said counter 
means with a threshold value; and 

direction determining means for determin- 
20 ing the direction of row of characters in said input 

image information, according to the result of com- 
parison by said comparator means. 



11. An image processing apparatus compressing: 

counter means for counting image data, on 
35 the input image information, in the horizontal and 

vertical directions; 

comparator means for comparing the 
maximum values in the horizontal and vertical 
directions in the image data counted by said 
40 counter means; and 

determination means for determining the 
direction of row of characters in said input image 
information, according to the result of comparison 
by said comparator means. 
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determination means for determining the 
direction of row of characters in said image infor- 
mation, according to thus derived vertical-to-hori- 
zontal ratio by said derivation means. 

5 

14. An image processing apparatus comprising: 

counter means for counting image data, on 
the input image information, in the horizontal and 
vertical directions; and 

derivation means for deriving the direction 1 o 
of a row of characters in a partial area in said 
image information, according to the results of 
counting by said counter means. 

15. Text processing apparatus comprising means for 15 
inputting text information and means for deter- 
mining whether said text information comprises 

text written parallel or perpendicular to the axis of 
the characters of said text. 

20 

16. Apparatus according to claim 15, comprising 
means for scanning in an image including said 
text, and for recognising characters therefrom. 

17. Apparatus according to claim 16, comprising 25 
means for recognising separate areas of said 
images corresponding to separately oriented text 
areas. 

18. Apparatus for determining the orientation of text 30 
comprising means for statistically analysing the 
orientation thereof. 

19. Apparatus for recognising text orientation com- 
prising means for recognising the orientation of 35 
the ragged edge of text 

20. A document scanner comprising means for per- 
forming print density histogram derivation in two 
orthogonal directions. 40 



45 



50 



55 
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